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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/134 , 333C 



DATE: 07/24/2003 
TIME: 08:08:58 



Input Set : A:\066001350XCIP-a.ST25.txt 
Output Set: N: \CRF4\07242003\I134333C , raw 

3 <110>* APPLICANT: LONGAC RE -ANDRE, SHIRLEY 

4 ROTH, CHARLES 

5 NATO, FARIDABNO 

6 BARNWELL, JOHN 

7 MENDIS, KAMINI 

9 <120> TITLE OF INVENTION: RECOMBINANT PROTEIN CONTAINING A C-TERMINAL FRAGMENT OF 
PLASMODIUM MSP-1 

11 <130> FILE REFERENCE: 0660-0135-OXCIP 

13 <140> CURRENT APPLICATION NUMBER: 09/134,3330 

14 <141> CURRENT FILING DATE: 1998-08-14 

16 <150> PRIOR APPLICATION NUMBER: PCT/FR97/00290 

17 <151> PRIOR FILING DATE: 1997-02-14 

19 <150> PRIOR APPLICATION NUMBER: FR96/01822 

20 <151> PRIOR FILING DATE: 1996-02-14 
22 <160> NUMBER OF SEQ ID NOS : 15 

24 <170> SOFTWARE: Patentin version 3.1 

26 <210> SEQ ID NO: 1 

27 <211> LENGTH: 291 

28 <212> TYPE: DNA 

29 <213> ORGANISM: Artificial Sequence E 

31 <220> FEATURE: 

32 <223> OTHER INFORMATION: synthetic DNA 

34 <220> FEATURE: 

35 <221> NAME/KEY: CDS 

36 <222> LOCATION: (1)..(291) 



T 



w — > 



37 


<223> OTHER 


INFORMATION: 






















39 


<400> 1 






























40 


gaa 


ttc 


aac 


ate 


teg 


cag 


cae 


caa 


tge 


gtg 


aaa 


aaa 


caa 


tgt 


ccc 


gag 


41 


Glu 


Phe 


Asn 


He 


Ser 


Gin 


His 


Gin 


Cys 


Val 


Lys 


Lys 


Gin 


Cys 


Pro 


Glu 


42 


1 








5 










10 










15 




44 


aac 


tct 


ggc 


tgt 


ttc 


aga 


eac 


ttg 


gac 


gag 


aga 


gag 


gag 


tgt 


aaa 


tgt 


45 


Asn 


Ser 


Gly 


Cys 


Phe 


Arg 


His 


Leu 


Asp 


Glu 


Arg 


Glu 


Glu 


Cys 


Lys 


Cys 


46 








20 










25 










30 






48 


ctg 


ctg 


aac 


tac 


aaa 


cag 


gag 


ggc 


gac 


aag 


tge 


gtg 


gag 


aac 


ccc 


aac 


49 


Leu 


Leu 


Asn 


Tyr 


Lys 


Gin 


Glu 


Gly 


Asp 


Lys 


Cys 


Val 


Glu 


Asn 


Pro 


Asn 


50 






35 










40 










45 








52 


ccg 


acc 


tgt 


aac 


gag 


aac 


aac 


ggc 


ggc 


tgt 


gac 


gea 


gae 


gee 


aaa 


tge 


53 


Pro 


Thr 


Cys 


Asn 


Glu 


Asn 


Asn 


Gly 


Gly 


Cys 


Asp 


Ala 


Asp 


Ala 


Lys 


Cys 


54 




50 










55 










60 










56 


acc 


gag 


gag 


gac 


teg. 


ggc 


age 


aac 


ggc 


aag 


aaa 


ate 


aeg 


tgt 


gag 


tgt 


57 


Thr 


Glu 


Glu 


Asp 


Ser 


Gly 


Ser 


Asn 


Gly 


Lys 


Lys 


He 


Thr 


Cys 


Glu 


Cys 


58 


65 










70 










75 










80 


60 


acc 


aaa 


ccc 


gac 


teg 


tac 


ccg 


ctg 


ttc 


gac 


ggc 


ate 


ttc 


tge 


age 


taa 


61 


Thr 


Lys 


Pro 


Asp 


Ser 


Tyr 


Pro 


Leu 


Phe 


Asp 


Gly 


He 


Phe 


Cys 


Ser 





48 



96 



144 



192 



240 



288 
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<223> OTHER INFORMATION: 


; synthetic peptide 












<4 00> SEQUENCE: 


2 
























Glu Phe Asn lie 


Ser 


Gin 


His 


Gin 


Cys 


Val 


Lys 


Lys 


Gin 


Cys 


Pro 


Glu 


1 


5 










10 










15 




Asn Ser Gly Cys 


Phe 


Arg 


His 


Leu 


Asp 


Glu 


Arg 


Glu 


Glu 


Cys 


Lys 


Cys 


20 










25 










30 






Leu Leu Asn Tyr 


Lys 


Gin 


Glu 


.Gly 


Asp 


Lys 


Cys 


Val 


Glu 


Asn 


Pro 


Asn 


35 








40 










45 








Pro Thr Cys Asn 


Glu 


Asn 


Asn 


Gly 


Gly 


Cys 


Asp Ala Asp Ala 


Lys 


Cys 


50 






55 










60 










Thr Glu Glu Asp 


Ser 


Gly 


Ser 


Asn 


Gly 


Lys 


Lys 


He 


Thr 


Cys 


Glu 


Cys 


65 




70 










75 










80 


Thr Lys Pro Asp 


Ser 


Tyr 


Pro 


Leu 


Phe 


Asp 


Gly 


He 


Phe 


Cys 


Ser 





RAW SEQUENCE LISTING DATE: 07/24/2003 

PATENT APPLICATION: US/09/134 , 333C TIME: 08:08:58 

Input Set : A:\066001350XCIP-a.ST25.txt 
Output Set: N:\CRF4\07242003\I134333C.raw 

62 85 90 95 

64 taa 291 

67 <210> SEQ ID NO: 2 

68 <211> LENGTH: 95 

69 <212> TYPE: PRT 

70 <213> ORGANISM: Artificial Sequence 
72 <220> FEATURE: 
73 
75 
77 
78 
81 
82 
85 
86 
89 
90 
93 
94 
97 

98 85 90 95 

101 <210> SEQ ID NO: 3 

102 <211> LENGTH: 279 

103 <212> TYPE: DNA 

104 <213> ORGANISM: Plasmodium falciparum 

106 <400> SEQUENCE: 3 

107 aacatttcac aacaccaatg cgtaaaaaaa caatgtccag aaaattctgg atgtttcaga 60 
109 catttagatg aaagagaaga atgtaaatgt ttattaaatt acaaacaaga aggtgataaa 120 
111 tgtgttgaaa atccaaatcc tacttgtaac gaaaataatg gtggatgtga tgcagatgcc 180 
113 aaatgtaccg aagaagattc aggtagcaac ggaaagaaaa tcacatgtga atgtactaaa 240 
115 cctgattctt atccactttt cgatggtatt ttctgcagt 279 

118 <210> SEQ ID NO: 4 

119 <211> LENGTH: 354 

120 <212> TYPE: DNA 

121 <213> ORGANISM: Artificial Sequence 

123 <220> FEATURE: 

124 <223> OTHER INFORMATION: synthetic DNA 

126 <220> FEATURE: 

127 <221> NAME/KEY: CDS 

128 <222> LOCATION: {1)..(354) 

129 <223> OTHER INFORMATION: 
W— > 131 <400> 4 

132 gaa ttc aac ate teg cag cac caa tgc gtg aaa aaa caa tgt ccc gag 48 

133 Glu Phe Asn lie Ser Gin His Gin Cys Val Lys Lys Gin Cys Pro Glu 

134 1 5 10 ' 15 

136 aac tct ggc tgt ttc aga cacttg gac gag aga gag gag tgt aaa tgt 96 

137 Asn Ser Gly Cys Phe Arg His Leu Asp Glu Arg Glu Glu Cys Lys Cys 

138 20 25 30 

140 ctg ctg aac tac aaa cag gag ggc gac aag tgc gtg gag aac ccc aac 144 
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1^1 


Leu 


Leu 


Asn 


lyr 


Lys 




Pin 
olU 


taiy /iSp 


Lys 


P\70 

v^ys 


V a± 


n n 

oXU 


Asn 


irro 


Asn 




T /I O 

14z 






OD 










40 








A c; 
4 0 










1 A A 

14 4 


ccg 


acc 


4- 4- 

tgt 


aac 


gag 


aac 


aac 


ggc ggc 


tgt 


gac 


gca 


gac 


gcc 


aaa 


tgc 


lyzi 


14 0 


Pro 


1 nr 


Cys 


Asn 


blU 


Asn 


Asn 


Gly Gly 


Cys 


Asp 


7\ 1 -1 

Ala 


Asp 


7\ 1 

Ala 


Lys 


Cys 




14 6 




50 










55 








60 












14 8 


acc 


gag 


gag 


gac 


teg 


ggc 


age 


aac ggc 


aag 


aaa 


ate 


aeg 


tgt 


gag 


tgt 


24 0 


14 9 


Thr 


Glu 


Glu 


Asp 


Ser 


Gly 


Ser 


Asn Gly 


Lys 


Lys 


He 


Thr 


Cys 


Glu 


Cys 




150 


65 










70 








75 










80 




152 


acc 


aaa 


ccc 


gac 


teg 


tac 


ccg 


ctg ttc 


gac 


ggc 


ate 


ttc 


tgc 


age 


tee 


288 


153 


Thr 


Lys 


Pro 


Asp 


Ser 


Tyr 


Pro 


Leu Phe 


Asp 


Gly 


He 


Phe 


Cys 


Ser 


Ser 




154 










85 








90 










95 






lOD 


tct 


aac 


ttc 


ttg 


ggc 


ate 


teg 


ttc ttg 


ttg 


ate 


etc 


atg 


ttg 


ate 


ttg 


o o ^ 

336 


157 


Ser 


Asn 


Phe 


Leu 


Gly 


He 


Ser 


Phe Leu 


Leu 


He 


Leu 


Met 


Leu 


He 


Leu 




158 








100 








105 










110 








160 


tac 


age 


ttc 


att 


taa 


taa 




















354 


161 


Tyr 


Ser 


Phe 


He 


























162 






115 




























165 


<210> SEQ ID NO: 


: 5 
























166 


<211> LENGTH: 116 
























167 


<212> TYPE: 


PRT 


























168 


<213> ORGANISM: 


Artificial ; 


Sequence 


















170 


<220> FEATURE: 


























171 


<223> OTHER 


INFORMATION: 


synthetic peptide- 














173 


<400> SEQUENCE: 


5 
























175 


Glu 


Phe 


Asn 


He 


Ser 


Gin 


His 


Gin Cys 


Val 


Lys 


Lys 


Gin 


Cys 


Pro 


Glu 




176 


1 








5 








10 










15 






179 


Asn 


Ser 


Gly 


Cys 


Phe 


Arg 


His 


Leu Asp 


Glu 


Arg 


Glu 


Glu 


Cys 


Lys 


Cys 




180 








20 








25 










30 








183 


Leu 


Leu 


Asn 


Tyr 


Lys 


Gin 


Glu 


Gly Asp 


Lys 


Cys 


Val 


Glu 


Asn 


Pro 


Asn 




184 






35 










40 








45 










187 


Pro 


Thr 


Cys 


Asn 


Glu 


Asn 


Asn 


Gly Gly 


Cys 


Asp 


Ala 


Asp 


Ala 


Lys 


Cys 




188 




50 










55 








60 












191 


Thr 


Glu 


Glu 


Asp 


Ser 


Gly 


Ser 


Asn Gly 


Lys 


Lys 


He 


Thr 


Cys 


Glu 


Cys 




192 


65 










70 








75 










80 




1 y 3 


Thr 


Lys 


Pro 


Asp 


Ser 


Tyr 


Pro 


Leu Phe 


Asp 


Gly 


He 


Phe 


Cys 


Ser 


Ser 




IOC 

1 y D 










85 








90 










95 






1 Q Q 

1 y y 


Ser 


Asn 


Phe 


Leu 


Gly 


He 


Ser 


Phe Leu 


Leu 


He 


Leu 


Met 


Leu 


He 


Leu 




0 n n 
z u u 








100 








105 










110 








203 


Tyr 


Ser 


Phe 


He 


























204 






115 




























207 


<210> SEQ ID NO: 


: 6 
























208 


<211> LENGTI^ 


[: 342 
























209 


<212> TYPE: 


DNA 


























210 


<213> ORGANISM: 


Plasmodium : 


falciparum 
















212 


<400> SEQUENCE: 


6 
























213 


aacatttcac aacaccaatg cgtaaaaaaa caatgtccag 


aaaattctgg ; 


atgtttcaga 


60 


215 


catttagatg aaagagaaga atgtaaatgt ttattaaatt 


acaaacaaga i 


aggtgataaa 


120 


217 


tgtgttgaaa atccaaatcc tacttgtaac gaaaataatg 


gtggatgtga " 


tgcagatgee 


180 


219 


aaatgtaccg aagaagattc aggtagcaac ggaaagaaaa 


tcacatgtga ; 


atgtactaaa 


240 
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O O 1 

z^: 1 


cctgattctt atccactttt cgatggtatt 


; ttctgcagtt 


cctctaactt cttaggaata 


ouu 


O O "D 


tcattcttat taatactcat gttaatatta 


I tacagtttca 


tt 










o4 z 


o o ^ 
D 


<210> SEQ ID NO: 


: 7 


























/ 


<211> LENGTH: 3^ 


M 


























o o o 


<212> TYPE: DNA 




























o o ri 


<213> ORGANISM: 


Plasmodium : 


falciparum 


















<220> FEATURE: 




























232 


<221> NAME/KEY: 


CDS 


























o o o 


<222> LOCATION: 


(1) . 


. . (387) 
























<223> OTHER INFORMATION: 
























W — > 236 


<400> 7 




























O T T 

237 


atg aag gcg eta 


etc 


ttt 


ttg 


ttc 


tct 


ttc 


att 


ttt 


ttc 


gtt 


acc 


aaa 


48 


O "3 o 

zoo 


Met Lys Ala Leu 


Leu 


Phe 


Leu 


Phe 


Ser 


Phe 


lie 


Phe 


Phe 


Val 


Thr 


Lys 




239 


1 


5 










10 










15 






z4 1 


tgt caa tgt gaa 


aca 


gaa 


agt 


tat 


aag 


cag 


ctt 


gta 


gee 


aac 


gtg 


gac 


96 


Z4z 


Cys Gin Cys Glu 


Thr 


Glu 


Ser 


Tyr 


Lys 


Gin 


Leu 


Val 


Ala 


Asn 


Val 


Asp 




z43 


20 










25 










30 








245 


gaa ttc aac ate 


teg 


eag 


cae 


caa 


tgc 


gtg 


aaa 


aaa 


caa 


tgt 


ccc 


gag 


14 4 


z4 b 


Glu Phe Asn lie 


Ser 


Gin 


His 


Gin 


Cys 


Val 


Lys 


Lys 


Gin 


Cys 


Pro 


Glu 




247 


35 








40 










45 










249 


aac tct ggc tgt 


ttc 


aga 


cac 


ttg 


gac 


gag 


aga 


gag 


gag 


tgt 


aaa 


tgt 


192 


250 


Asn Ser Gly Cys 


Phe 


Arg 


His 


Leu 


Asp 


Glu 


Arg 


Glu 


Glu 


Cys 


Lys 


Cys 




251 


50 






55 










60 












253 


ctg ctg aac tac 


aaa 


eag 


gag 


ggc 


gac 


aag 


tgc 


gtg 


gag 


aac 


ccc 


aac 


240 


254 


Leu Leu Asn Tyr 


Lys 


Gin 


Glu 


Gly 


Asp 


Lys 


Cys 


Val 


Glu 


Asn 


Pro 


Asn 




255 


65 




70 










75 










80 




257 


ccg acc tgt aac 


gag 


aac 


aac 


ggc 


ggc 


tgt 


gac 


gca 


gac 


gcc 


aaa 


tgc 


288 


258 


Pro Thr Cys Asn 


Glu 


Asn 


Asn 


Gly 


Gly 


Cys 


Asp 


Ala 


Asp 


Ala 


Lys 


Cys 




259 




85 










90 










95 






261 


acc gag gag gac 


teg 


ggc 


age 


aac 


ggc 


aag 


aaa 


ate 


acg 


tgt 


gag 


tgt 


336 


262 


Thr Glu Glu Asp 


Ser 


Gly 


Ser 


Asn 


Gly 


Lys 


Lys 


He 


Thr 


Cys 


Glu 


Cys. 




263 


100 










105 










110 








265 


acc aaa ccc gac 


teg 


tac 


ccg 


ctg 


ttc 


gac 


ggc 


ate 


ttc 


tgc 


age 


taa 


384 


266 


Thr Lys Pro Asp 


Ser 


Tyr 


Pro 


Leu 


Phe 


Asp 


Gly 


He 


Phe 


Cys 


Ser 






267 


115 








120 










125 










269 


taa 


























387 


O "7 O 

z / z 


<210> SEQ ID NO: 


; 8 


























273 


<211> LENGTH: 127 


























274 


<212> TYPE: PRT 




























275 


<213> ORGANISM: 


Plasmodium : 


falciparum 
















277 


<400> SEQUENCE: 


8 


























279 


Met Lys Ala Leu 


Leu 


Phe 


Leu 


Phe 


Ser 


Phe 


He 


Phe 


Phe 


Val 


Thr 


Lys 




280 


1 


5 










10 










15 






283 


Cys Gin Cys Glu 


Thr 


Glu 


Ser 


Tyr 


Lys 


Gin 


Leu 


Val 


Ala 


Asn 


Val 


Asp 




284 


20 










25 










30 








287 


Glu Phe Asn lie 


Ser 


Gin 


His 


Gin 


Cys 


Val 


Lys 


Lys 


Gin 


Cys 


Pro 


Glu 




288 


35 








40 










45 










291 


Asn Ser Gly Cys 


Phe 


Arg 


His 


Leu 


Asp 


Glu 


Arg 


Glu 


Glu 


Cys 


Lys 


Cys 




292 


50 






55 










60 
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295 Leu Leu Asn Tyr Lys Gin Glu" Gly Asp Lys Cys Val Glu Asn Pro Asn 

296 65 70 75 80 

299 Pro Thr Cys Asn Glu Asn Asn Gly Gly Cys Asp Ala Asp Ala Lys Cys 

300 85 90 95 

303 Thr Glu Glu Asp Ser Gly Ser Asn Gly Lys Lys lie Thr Cys Glu Cys 
.304 100 105 110 

307 Thr Lys Pro Asp Ser Tyr Pro Leu Phe Asp Gly lie Phe Cys Ser 

308 115 120 125 

311 <210> SEQ ID NO: 9 

312 <211> LENGTH: 330 

313 <212> TYPE: DNA 

314 <213> ORGANISM: Plasmodiuin falciparum 

316 <220> FEATURE: 

317 <221> NAME/KEY: CDS 

318 <222> LOCATION: (1)..(330) 

319 <223> OTHER INFORMATION: 
W— > 321 <400> 9 



322 


gaa 


aca 


gaa 


agt 


tat 


aag 


cag 


ctt 


gta 


gcc 


aac 


gtg 


gac 


gaa 


ttc 


aac 


48 


323 


Glu 


Thr 


Glu 


Ser 


Tyr 


Lys 


Gin 


Leu 


Val 


Ala 


Asn 


Val 


Asp 


Glu 


Phe 


Asn 




324 


1 








5 










10 










15 






326 


ate 


teg 


cag 


cac 


caa 


tgc 


gtg 


aaa 


aaa 


caa 


tgt 


ccc 


gag 


aac 


tct 


ggc 


96 


327 


He 


Ser 


Gin 


His 


Gin 


Cys 


Val 


Lys 


Lys 


Gin 


Cys 


Pro 


Glu 


Asn 


Ser 


Gly 




328 








20 










25 










30 








330 


tgt 


ttc 


aga 


cac 


ttg 


gac 


gag 


aga 


gag 


gag 


tgt 


aaa 


tgt 


ctg 


ctg 


aac 


144 


331 


Cys 


Phe 


Arg 


His 


Leu 


Asp 


Glu 


Arg 


Glu 


Glu 


Cys 


Lys 


Cys 


Leu 


Leu 


Asn 




332 






35 










40 










45 










334 


tac 


aaa 


cag 


gag 


ggc 


gac 


aag 


tgc 


gtg 


gag 


aac 


ccc 


aac 


ccg 


acc 


tgt 


192 


335 


Tyr 


Lys 


Gin 


Glu 


Gly 


Asp 


Lys 


Cys 


Val 


Glu 


Asn 


Pro 


Asn 


Pro 


Thr 


Cys 




336 




50 










55 










60 












338 


aac 


gag 


aac 


aac 


ggc 


ggc 


tgt 


gac 


gca 


gac 


gcc 


aaa 


tgc 


acc 


gag 


gag ' 


240 


339 


Asn 


Glu 


Asn 


Asn 


Gly 


Gly 


Cys 


Asp 


Ala 


Asp Ala 


Lys 


Cys 


Thr, 


Glu 


Glu 




340 


65 










70 










75 










80 




342 


gac 


teg 


ggc 


age 


aac 


ggc 


aag 


aaa 


ate 


acg 


tgt 


gag 


tgt 


acc 


aaa 


ccc 


288 


343 


Asp 


Ser 


Gly 


Ser 


Asn 


Gly 


Lys 


Lys 


He 


Thr 


Cys 


Glu 


Cys 


Thr 


Lys 


Pro 




344 










85 










90 










95 






346 


gac 


teg 


tac 


ccg 


ctg 


ttc 


gac 


ggc 


ate 


ttc 


tgc 


age 


taa 


taa 






330 


347 


Asp 


Ser 


Tyr 


Pro 


Leu 


Phe 


Asp 


Gly 


He 


Phe 


Cys 


Ser 












348 








100 










105 



















351 <210> SEQ ID NO: 10 

352 <211> LENGTH: 108 

353 <212> TYPE: PRT 

354 <213> ORGANISM: Plasmodium falciparum 
356 <400> SEQUENCE: 10 

358 Glu Thr Glu Ser Tyr Lys Gin Leu Val Ala Asn Val. Asp Glu Phe Asn 

359 15 10 15 

362 He Ser Gin His Gin Cys Val Lys Lys Gin Cys Pro Glu Asn Ser Gly 

363 20 25 30 

366 Cys Phe Arg His Leu Asp Glu Arg Glu Glu Cys Lys Cys Leu Leu Asn 

367 35 40 45 
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L:39 M:258 W: Mandatory Feature missing, <223> Blank for SEQ# : 1 , Line# : 37 
L:131 M:258 W: Mandatory Feature missing, <223> Blank for SEQ# : 4 , Line# : 129 
L:236 M:258 W: Mandatory Feature missing, <223> Blank for SEQ# : 7 , Line# : 234 
L:321 M:258 W: Mandatory Feature missing, <223> Blank for SEQ# : 9, Line# : 319 
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